Document Analysis System

نویسندگان

  • Kwan Y. Wong
  • Richard G. Casey
  • Friedrich M. Wahl
چکیده

This paper outlines the requirements and components for a proposed Document Analysis System, which assists a user in encoding printed documents for computer processing. Several critical functions have been investigated and the technical approaches are discussed. The first is the segmentation and classijication of digitized printed documents into regions of text and images. A nonlinear, run-length smoothing algorithm has been used for this purpose. By using the regular features of text lines, a linear adaptive classification scheme discriminates text regions from others. The second technique studied is an adaptive approach to the recognition of the hundreds of font styles and sizes that can occur on printed documents. A preclassifier is constructed during the input process and used to speed up a well-known pattern-matching method for clustering characters from an arbitrary print source into a small sample of prototypes. Experimental results are included.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Document Analysis And Classification Based On Passing Window

In this paper we present Document analysis and classification system to segment and classify contents of Arabic document images. This system includes preprocessing, document segmentation, feature extraction and document classification. A document image is enhanced in the preprocessing by removing noise, binarization, and detecting and correcting image skew. In document segmentation, an algorith...

متن کامل

Explaining the Components of Moral Education of Learners and Analyzing its Position in Fundamental Reform Document of Education

The aim of this study was to identify the components of moral education of learners in the higher-tier documents of the country’s educational system and to determine the level of attention paid to these components in the Fundamental Reform Document of Education. In this research, a qualitative-deductive content analysis method was utilized. For this purpose, the unit of analysis was all the sen...

متن کامل

Evaluation of EAP Programs in Iran: Document Analysis and Expert ‎Perspectives

This study aimed to examine the policies in the Iranian English for Academic Purposes (EAP) education and the extent to which objectives match the policies and are materialized in practice. To this end, course descriptions in the syllabi for the EAP programs were evaluated through document analysis and triangulated with the experts’ perspectives through interviews to examine the current status ...

متن کامل

A policy framework for the challenges of implementing regional higher education management in Iran

The models of regional governance in the world, particularly for administration of higher education are considered vital. In Iran, with the approval of Iran's Higher Education System Spatial Management Document, the issue of regional management in higher education was given special attention. Articles 1 and 2 of the document specifically address the regional higher education structure of the ...

متن کامل

Evaluation of Health System Development Plan and Basic Education Transformation Plan Based on Health System Assumptions with Emphasis on Education

Background and Objective: Health education and health promotion are considered an important source for economic, social and individual development. It is the governments’ important role to consider it as a crusial matter and all human beings need training to achieve this worthwhile goal, namely health. Methods: This study was carried out using content analysis “Shannon Entropy”. In this method ...

متن کامل

Information and data flow analysis for forestry sector in Iran as a basic requirement for designing a forest information system (FIS)

ABSTRACT The aim of this study was to evaluate the status of information on forest and data transfer and to identify the gaps in information and data flow in forestry sector in Iran. The study evaluated the data and information flow in three levels (control offices level, provincial offices level and organizational offices level) using the document analysis and questioning (interviews and ques...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IBM Journal of Research and Development

دوره 26  شماره 

صفحات  -

تاریخ انتشار 1982